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The phase sensitivity of interferometers is limited by the so-called Heisenberg limit, which states 
that the optimum phase sensitivity is inversely proportional to the number of interfering particles 
A'^, a 1/y/N improvement over the standard quantum limit. We have used simulated annealing, a 
global optimization strategy, to systematically search for quantum interferometer input states that 
approach the Heisenberg limited uncertainty in estimates of the interferometer phase shift. We 
compare the performance of these states to that of other non-classical states already known to yield 
Heisenberg limited uncertainty. 



I. INTRODUCTION 



An important aspect of quantum metrology is the en- 
gineering of quantum states with which to achieve mea- 
surements whose precision is Heisenberg limited. In this 
Hmit the measurement uncertainty is inversely propor- 
tional to the number of interfering particles N , repre- 
senting a improvement over the standard quantum 
hmit. Squeezed light has long been employed to beat the 
shot-noise limit [l|, and a growing body of theoret- 
ical literature indicates that the Heisenberg limit is in 
principle achievable using more exotic quantum states as 
interferometer inputs [3^0, S H, 0, 111- Several proof- 
of-principlc experimental realizations of such states have 
recently been carried out [1, [13, [HI, [H, [II] ■ Other pro- 
posals to beat the standard quantum limit involve the use 
of feedback schemes [3, [HI or multi-mode interferome- 
try [l^. The potential superiority of atomic fermions 
over bosons in some applications of atom interferome- 
try with quantum-degenerate atomic gases has also been 
pointed out [13, [11 ■ 



This paper summarizes the results of a systematic 
search for input quantum states that lead to Heisenberg 
limited intcrferometric detection of phase shifts. Using 
the global optimization method of simulated annealing 
we demonstrate the existence of numerous possibilities 
over-and-above those already proposed in the literature, 
and we evaluate and compare their performance. 



Section II discusses our theoretical model of a Mach- 
Zehnder interferometer used to measure the relative 
phase shift </> accumulated during the propagation of 
single-mode optical or matter waves along its two arms. 
Section HI introduces a likelihood function used to es- 
timate that phase and discusses its asymptotic form in 
the limit of many measurements. Section IV summarizes 
our main results obtained using simulated annealing and 
section V focuses on the prospects for the experimental 
realization of a quantum state of particular interest. Fi- 
nally, section VI is a summary and conclusion. 
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FIG. 1; (Color online) Schematic of a Mach-Zehnder interfer- 
ometer, with relative phase shift <j} resulting from propagation 
of an input field through its arms. 



II. MACH-ZEHNDER INTERFEROMETER 

We consider a Mach-Zchndcr interferometer with two 
input ports A and B, see Fig.[Tl characterized by bosonic 
annihilation and creation operators a and and b and 
b'^ , respectively. We restrict our investigation to a system 
with fixed particle number iV, in which case its proper- 
ties are conveniently described in terms of the angular 
momentum operators [3] 
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which obey the familiar commutation relations 
£ijkiJk, where Eijk is the Levi-Civita symbol, and 
J^,Ji ~ 0. Choosing z as the quantization axis we 
work in the basis of eigenstates common to and Jz, 

\j,m)z = \na)\nb). (5) 
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Here \ni) is a Fock state with particles in arm / = 
A, B. For brevity we drop the subscript z henceforth. 
The eigenvalues corresponding to and Jz arc j{j + 1) 
and m respectively, where 



J = {ua + nb)/2 = N/2 



(6) 



and 



The probability to measure |z) conditioned on that phase 
is P{i\^) = with 



N 
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m = {ua- nb)/2. 
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In terms of these operators, the propagation of the input 
fields through the interferometer - which consists of three 
unitary transformations describing an input 50/50 beam- 
splitter, the relative phase shift 0, and an output 50/50 
beam splitter ~ reads 



Bayes' theorem states that the probability that the 
phase shift has the value (f>, conditioned on the outcome 
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Our aim is to find input states 
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such that the uncertainty in the estimate of the phase </> 
is minimized. 

Our restriction in this paper to fixed total particle 
number leads to considerable analytical and computa- 
tional simplification, but the more general problem in 
which the average particle number is conserved is also 
of interest, and can in principle be carried out with the 
same techniques. 



III. LIKELIHOOD FUNCTION 

A number of approaches have been used as measures of 
the uncertainty in estimate of the relative phase </>. Com- 
monly the standard error propagation formula is used 
to express this phase uncertainty in terms of the mean 
square error of a measured observable such as the particle 
number difference llTl 



(11) 



Probability operator measures are also used [lj|, as well 
as information theoretical measures such as the Shannon 
mutual information [l^. In this paper we estimate the 
relative phase follo wing an operational approach based on 
Bayes' theorem 0, l20l. [2ll [22| . Consider an experiment 
in which the probability amplitude of the i'^ basis state 
of an TV -|- 1 dimensional Hilbert space depends on some 
phase (j), 



N 



|V') = 5]a.(0)|z) 



(12) 



where P{<j)) is the phase probability distribution prior to 
the measurement and P{i) is the prior detection proba- 
bility for the outcome i. Following a measurement with 
outcome ii, the phase probability distribution becomes 
P{(j)\ii), which may now be used as the prior phase prob- 
ability distribution for a second measurement Q , so that 



P{cl>\il,t2) = 



P(0Kl)P(*2|0) P(0)P(ii|0)P(i2| 
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Likewise, the phase probability distribution conditioned 
on the outcome of a sequence of M measurements is 



P(0|ii,Z2, ...,iM) = 



P{^)P{i,\4>)P{i2\4>)-P{ni\4>) 

P{ll)P{l2)...P{lM) 



(16) 

For a large number of measurements, M 3> 1, and as- 
suming that the true phase shift is = 0. the number 
of times a factor P{i\(f) appears in the product (|16|1 is 
approximately P{i\6)M. This motivates the introduc- 
tion of a likelihood function for the phase shift to be 0, 
conditioned on its true value being 9, as j2ll . [23 | 
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is a normalization constant. The likelihood function 
PM{<t>\(^) has the desirable property that it possesses a 
maximum at the true value, 0, of the phase shift. This is 
easily shown by taking its derivative 
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Evaluating Eq. at 6*, together with the normahzation 
condition gives then 
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implying an extremum at 9. Taking the second derivative 
and again using normalization shows this extremum to be 
a maximum. 

In order to estimate the phase uncertainty in the limit 
of large M we introduce the function 



Kim = iogP{m- 



(21) 



Expanding then K((j)\9) around (p = 9 and accounting for 
the normalization condition (fTS]) we find that P{(j)\9) is 
approximately given by 
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and 
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is the so-called Fisher information, as shown in Appendix 
A [2^. For large M the exponential suppresses strongly 
those contributions to P{4)\9) for which cj) > cr so that 
P{(j)\9) becomes Gaussian in that limit [20|. Asymptoti- 
cally, the likelihood function is therefore completely char- 
acterized by its variance, or equivalently by its Fisher 
information. Equation ([23]) also shows that the phase 
uncertainty decreases as the inverse square root of the 
number of measurements. 

The Fisher information plays an important role in in- 
formation theory as it gives a lower limit to the variance 
of any estimator via the Cramer-Rao inequality (23j 



var(.T) > 
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where var(a;) is the mean square error of the random 
variable x being estimated and Eq. (I^S)) is the defin- 
ing relation for the Fisher information. (Note also that 



the Fisher information of M' independent and identically 
distributed samples is M' times the individual Fisher in- 
formation.) Thus Eq. ([22]) indicates that the likelihood 
function Pm{4>\9) achieves the Cramer- Rao limit. It per- 
mits us to find input states of the interferometer of the 
form of Eq. pop that result in an estimate of the phase 
shift with minimum uncertainty. 

To illustrate how the likelihood function may be used 
to estimate a phase shift experimentally, consider a 
thought experiment using the Mach-Zehnder interferom- 
eter in Fig. 1. Each measurement counts the number of 
particles exiting the interferometer in arm "A" . Due 
to particle conservation, this is a direct measure of the 
quantum number m. Expanding the exit state of the field 



|V'out(<^)) = ^a,n{4')\3, 



(26) 



each measurement yields a specific particle number ria 
N/2 4- m'*^ with an associated phase probability distri- 
bution 



P(mW|0) 



(27) 



After M such measurements the conditional phase prob- 
ability distribution takes the form of Eq. p6p , which for 
sufficiently large M is a good approximation to the like- 
lihood function Eq. (fT7|) — up to the normalization con- 
stant as in Eq. psp . The maximum of this conditional 
phase probability distribution is an estimate of the phase 
shift and its variance gives the uncertainty. 

An important consideration is the number of measure- 
ments needed for the conditional phase probability dis- 
tribution, Eq. p6p . to be an accurate representation of 
the likelihood function. This matter is not addressed in 
this paper where we use throughout the assymptotic form 
of the likelihood function, but has been investigated by 
Braunstein 1241. 



IV. RESULTS 

This section summarizes results of a numerical search 
for optimum input states of the interferometer. This 
search employed the glob al optimization protocol of sim- 
ulated annealing [25l. l26j|. whose main features arc sum- 
marized in Appendix B. 
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FIG. 2; (Color online) (a) Probability amplitudes Om for the twin-Fock state (solid black line), the external NOON state (green 
dot-dashed line), the internal NOON state Eq. (31) (solid gray line) and the di-Fock state |i/'di(g = 1)) (red dotted line). 
Corresponding likelihood functions for (b) (M = 1) and (c) (M = 10). In the di-Fock state secondary peaks are already almost 
completely absent for M = 1, thus concentrating more probability density around (j) = 0. Secondary features are however 
suppressed for larger values of M in all cases, indicating that the twin-Fock state results in a slightly smaller phase uncertainty. 



To set the stage for this discussion, we first recall that 
several states have previously been proposed as good can- 
didates for Heisenberg limited interferometry. One such 
state is the balanced twin-Fock input state [y, Q 

|7^tw) = b,0) = |7V/2),|iV/2)b, (28) 

a state that we use as a benchmark in the following dis- 
cussion. It was suggested in Refs. 0, Q that improve- 
ments over that state can be achieved by using instead 
the state 

|Vdi(9)) = l/V2(|j,g) + |j,-g)), (29) 

with q = 1. This state, which we refer to as a di-Fock 
state in the following, presents the advantage of suppress- 
ing secondary peaks in the Ukelihood function, thus con- 
centrating more probability density around the true value 
of the phase shift. 

It has also been proposed that Heisenberg limited 
phase sensitivity can be achieved with the so-called NOON 
state [27t 

IVnoon) = \An{q = .?)> = l/\/2 (I.?, j) + |j, -j)) 

= l/V2{\N)a\0)b+\0)a\N)b). 

(30) 

Some disagreement exists in the current literature regard- 
ing the phase sensitivity of NOON states, with some au- 
thors claiming that it in fact obeys shot noise limited 
sensitivity Q. Mitchell et al. [l3| as well as Walther 
et al. [l2| have pointed out that NOON states may be 
used to produce super-resolving phase oscillations, with 
a period of 27r/iV, in interfcrometric measurements. In 
agreement with this we will show that similar oscillations 
occur in the likelihood function, if Eq. (30) describes the 
state of the system after the first beam splitter. To be ex- 
plicit we distinguish between external NOON states for 
which Eq. ()30p is the state before the first beam splitter, 
and internal NOON states for which Eq. (|5n|) is the state 
after the first beam splitter. The internal NOON state is 
equivalent to using an input state 

e-^J^-^/^lijmoN). (31) 



It can also be achieved by using as an input the state 
I a 1 0)6 and replacing the first beam splitter with a non- 
linear beamsplitter with appropriate interaction time, as 
shown in Q. 

Figurc[2)Ja) shows the probability amplitudes am of the 
basis states |j, m) for the input states of Eqs. (p8|) - (|29)) . 
as well as the external NOON and internal NOON states, 
for a system with = 20 particles, and a relative phase 
9 = between the two arms of the interferometer. The 
solid black line corresponds to the twin-Fock input, the 
dotted red line to the input state \ipdi{q = 1)), the green 
dot-dashed line to the external NOON state and the gray 
solid line to the internal NOON state . The correspond- 
ing likelihood functions for M = 1 are plotted in Fig. [2] 
(b). Apart from the internal NOON state the probabil- 
ity density is concentrated close to = in all cases, 
but the di-Fock state seems more favorable as it results 
in a narrow distribution with no significant secondary 
peaks. However, this apparent advantage rapidly disap- 
pears for larger M, in which case the secondary peaks as- 
sociated with the twin-Fock state are suppressed, leading 
to a slightly narrower distribution. Note the distribution 
corresponding to the external NOON state remains con- 
siderably wider than the other candidates, indicating a 
larger uncertainty in the phase estimate. The likelihood 
function of the internal NOON state rapidly oscillates with 
a period of 2tt/N radians. This is consistent with the A^- 
fold increase in phase oscillations observed in [13, [l^l ■ 

Figure [3] illustrates some of the large number of possi- 
ble input states numerically obtained from the simulated 
annealing algorithm for a system of = 100 particles. 
Column A plots the probability amplitudes of the 
input states; column B shows the corresponding likeli- 
hood functions with M = 1 (broken red broken lines) and 
compares them to the likelihood function of the bench- 
mark twin-Fock input (solid black lines); column C plots 
the situation for Af = 10 measurements. A remarkable 
feature of these results is that while these input states 
are very markedly different, their likelihood functions be- 
come almost indistinguishable for large M. Surprisingly 
perhaps the optimization procedure clearly shows the ex- 
istence of a large number of local minima resulting in 
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FIG. 3: (Color online) Column A: probability amplitudes for several possible input states with A'^ — 100 particles. Column 
B: Corresponding likelihood functions for M = 1 (dotted red line), compared to the result for the benchmark twin-Fock input 
(solid black line). Column C: Likelihood functions for (M — 10). 
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FIG. 4: (Color online) Scaling of the inverse square root 
Fisher information, a = I/x/F with particle number for 
the state in row 1 of Fig. 3 (blue dotted line), twin-Fock 
state (solid red line), the "gaussian state" (black dashed line) 
and the external NOON state (green dot-dashed line). The 
points are calculated using Eq. (|24|l while the lines corre- 
spond to least square fits, with the values external NOON state: 
cr = LO/iV^/^, "gaussian": a = 1.9/iV, twin-Fock: cr = 1.4/iV 
and state in row 1 of Fig. 3: ct = 1.0/A^. Note the log-log 
scale. 



almost identical likelihood functions. 

To demonstrate that all states identified by the simu- 
lated annealing algorithm indeed result in approximate 
Heisenberg limited phase sensitivity, Fig. 4 shows a log- 
log plot of the inverse square root Fisher information, 
1/\/F, as a function of particle number over a range of 
50 < < 2500. The solid red fine is for twin-Fock states, 
the green dot-dashed line for external NOON states, the 
dotted blue line for the uppermost state in Fig. [3l and 
the dashed black line for the state in row 2 of Fig. [3l For 
clarity we refer to the latter state as a "gaussian state" 
in the following. All lines are least square fits of the 
equation 



C 



(32) 



where C and f3 are fit parameters. 

Table 1 summarizes the results of this fit for the states 
of Fig. |3l the number referring to the row number in the 
figure. Up to differences of a few percent in the over- 
all proportionality constant C, all of these states clearly 
satisfy the 1 /N scaling characteristic of Heisenberg lim- 
ited sensitivity, the only notable exception being the ex- 
ternal NOON state, which (in agreement with Pezze and 
Smerzi 0) is shot-noise limited. On the other hand, 
the inverse square root Fisher information of the internal 
NOON state does scale with the Heisenberg limit. Despite 
this, the rapid oscillations in the likelihood function seen 
in Fig. 2(b) and (c) allow a phase estimate only mod- 
ulo 27r/7V. The consequent ambiguities imply that the 
internal NOON state may not be useful for phase estima- 
tion when using the current Bayesian analysis unless one 
has a priori knowledge that the phase shift lies within a 
particular phase-bin of width 2tt/N . We now discuss the 
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TABLE I: Scaling of the phase uncertainty as a function of 
the particle number A'^ for the the twin state, the external- 
and internal NOON states, the di-Fock state, and the input 
states of Fig. |3l 



candidates obtained by search algorithms in turn. 

The state with the highest Fisher information that we 
found, an apparent global optimum, is shown in row 1 
of Fig. [31 The envelope of its probability amplitudes 
is a Gaussian with width w = ^/N. Despite the high 
Fisher information of that state, though, it produces a 
significant ambiguity in the determination of the phase 
estimate as secondary peaks persist even for M — 10, as 
seen in column C. A feature not apparent on the scale in 
this figure is that the central peak is the absolute max- 
imum and becomes increasingly dominant for increased 
M. Yet, as in the case of the internal NOON state, the per- 
sistence of secondary peaks for relatively large sequences 
of measurements indicates that it may not be the most 
useful state in practice. 

In the case of the "gaussian state" , second row of Fig.[3l 
the probability amplitudes am have a Gaussian distribu- 
tion around the state \j, ?7i = 0) , 

am = exp [-?7i^/cr'^]/7V', 

where Af' is a normalization constant, and the standard 
deviation is a' = 1.7 for the example at hand. That state 
results in Heisenberg limited sensitivity for < cr' ^ j, 
with 



1 _ a' 



(33) 



The limit c' — > corresponding to the twin-Fock state. 

The state described in the third row of Fig. [3] is an 
example of a state we refer to as a tri-Fock state, and it 
has the form 

|Vtri(9)) = ia.\3,-q) + ao\],0) + a+\j,q)) /M' , (34) 

where Af' is a normalization constant. We find numeri- 
cally that it results in Heisenberg limited sensitivity for 
any value of q. 

The fourth row in Fig. [3] describes a state that is a 
superposition of four Fock states. For the — 100 par- 
ticles considered in our simulations, and the state in row 
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one of Fig. [3] aside, we have found states with superpo- 
sitions containing up to ~ 8 Fock states that result in 
Heisenberg- limited sensitivity. 

We also found that di-Fock states, Eq. ([^^ . with ar- 
bitrary q generally result in Heisenberg limited or near 
Heisenberg limited scaling for q as large as q < 0.98j. 
For larger q the state approaches the shot-noise limited 
external NOON state. 

Several general trends can be noted in the results of 
our search. First, we find that the scaling of Eq. (|32p de- 
pends only weakly on the relative probability amplitudes 
Q!M of the Fock states involved. Changing the relative 
amplitudes of these coefficients by factors as large as 3 
typically results in changes in the coefficient C by a few 
percent only. The Gaussian state is a notable exception 
to this trend, and obeys instead the scaling equation ([33)) . 

Second, we found no inherent symmetry in the input 
states that result in the Heisenberg limit. This is illus- 
trated by the states of rows 5-6 in Fig. [3l For example, 
the state of row 5 is an unbalanced twin- Fock state of the 
form 

I'^tw) = (35) 

where 7 is some fraction. We found numerically that 
the state resulted in Heisenberg limited sensitivity for 
0.02 < 7 < 0.98. 

All states shown in Fig. [3]have real amplitudes. Allow- 
ing for complex amplitudes of the same magnitudes re- 
tains Heisenberg limited or near Heisenberg limited scal- 
ing, with /3 > 0.95 in Eq. ^ and the coefficient C 
changed by only a few percent. Again the effect is more 
pronounced in the Gaussian state, where the change in 
C can be up to a factor of ~ 2. This is because that state 
has neighboring states occupied, and the number statis- 
tics of these states influence each other even for small 
phase shifts. 

Due to the existence of numerous input states resulting 
in nearly identical uncertainties within the measurement 
scheme presented here, a more relevant criterion for the 
selection of an appropriate input state is likely to be its 
ease of experimental realization. We address this point 
in some more detail in the next section for the case of the 
"gaussian state." 



V. THE GAUSSIAN STATE 

The "gaussian" input state is a promising candidate 
for Heisenberg limited interferometry for two reasons: (1) 
there is a simple experimental scheme available to gen- 
erate it; (2) as we show below, the expectation value of 
Jz is phase dependent in that case (as opposed to the 
situation for twin-Fock input states) providing an alter- 
native phase estimate to the direct measurement of the 
likelihood function, while still allowing Heisenberg lim- 
ited sensitivity. 



A. Number statistics 

As mentioned in section HI, the likelihood function 
can be experimentally reconstructed by multiplying the 
phase probability distributions P{i\(f>) associated with a 
sequence of measurement outcomes. This is the approach 
that was adopted in the simulations carried out in Refs. 
H, 0] for twin-Fock input states. In that case however 
the average particle number difference remains zero for 
all relative phase shifts between the interferometer arms, 
and is therefore not a useful observable. The same is 
true for the majority of the states that we identified in 
our numerical optimization search. One way to circum- 
vent this difficulty is to measure instead the variance of 
Jz, an approach that still results in Heisenberg limited 
estimates. However, as was pointed out in Ref. for 
the case of the twin-Fock state, the signal to noise ratio 

is then small, (Jf)/-y/(J4) - (J|)2 = V2. 

One advantage of using a gaussian input state instead 
is that (Jz) now depends on the relative phase 0. This is 
illustrated in Fig. 5(a)-(c), which shows the probability 
distribution P{m\(j)) at the exit of the interferometer for 
twin-Fock and gaussian input states, and for phase shifts 
(j) = 0, (j) ^ 7r/100 and (p = tt/IO. The expectation 
value of Jz for the Gaussian state is clearly not equal 
to zero for non-zero phase shifts, and may therefore be 
used directly to estimate that shift. The uncertainty in 
Jz , evaluated via the standard error propagation formula 
Eq. (jlip . is shown in Fig. 6 as a function of the number 
of interfering particles. Least square fits indicate that the 
uncertainties are Heisenberg limited, with Acj) — 2.0/N 
for the gaussian state and similarly A0 = 2A/N in the 
case of a tri-Fock state with q = l. 

B. Input state engineering 

We have seen that the constraints on the relative 
phases of the complex probability amplitudes ai of the in- 
put states are surprisingly weak when estimating phase 
shifts via a reconstruction of the likelihood function. 
However, the number statistics of the field after passage 
through the interferometer depend critically on these rel- 
ative phases. For example, if in the Gaussian state of Fig. 
[3]the components |j, —1) and |j, are it/2 radians out 
of phase with |j, 0), the output state \tpout) will be popu- 
lated symmetrically around |j, 0) independently of 0, so 
that {Jz) = for all </>. Hence, some care must be taken 
in preparing the input states IV'in)- 

It is possible to generate a gaussian input state from 
an initial twin-Fock state, Eq. ((28|l . by subjecting it to 
the Hamiltonian 

H^, = hgX, (36) 

where 5 is a coupling constant, for a time r = i-K /ANg. 
The resulting state is precisely the state shown in row 
2 of Fig. 3, but with the probability amplitudes of the 
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FIG. 5: (Color online) Number statistics for phase shifts oi (j) — 0, (p — tt/IOO and </> = tt/IO in a system with A'^ — 100 
particles for (a)-(c) a twin-Fock input state and (d)-(f) a "gaussian" input state. The twin-Fock state populates number states 
symmetrically around m = 0, leading to an expectation value of (Jz) = for all values of 0. This is not the case for a gaussian 
state. 
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FIG. 6: (Color online) Scaling of the uncertainty in the es- 
timate of the relative phase from a measurement of average 
particle number difference. The dashed black line is for a 
Gaussian state and the red solid line of a tri-Fock state, Eq. 
P4|) . with g = 1, and the green dot-dashed line represents 
the state engineered as described in section V(B). The points 
are numerically determined while the lines correspond to least 
square fits. Note the log-log scale. 

components \j, —1) and |j, tt/2 radians out of phase 
with the component \j,0). These three components can 
be brought into phase via an additional evolution under 
the Hamiltonian 

= fix (37) 

for a time r = 3tt/2x- The resulting state allows Hcisen- 
berg limited phase estimation by measuring (Jz), with 
the scaling law A0 = 3.2/ N as a function of the particle 
number N. We remark that while the time evolution |37|) 
of the input state brings the three main components of 
\ipin) in phase with each other, that is not so for the other, 
weakly populated number states that comprise it. This 



results in a somewhat reduced performance compared to 
Gaussian state with all components in phase. 

Hamiltonians of the form Eq. ((37|) have long been 
known to act as squeezing operators in interferometers 
[2^. In the case of photons, they can be implemented 
by inserting an optical Kerr medium into each arm of 
the interferometer [2^. In the case of charged parti- 
cles they arise due to mutual phase modulation from 
Coulomb interaction between particles in each arm [soj . 
Atomic spins coupled to the polarization of an optical 
field (biI . [3^ can lead to similar Hamiltonians for neutral 
atoms. 



VI. CONCLUSION 

We have used a global optimization scheme to system- 
atically search for input states of a quantum mechanical 
Mach-Zehnder interferometer that yield phase estimates 
with accuracy scaling like the inverse number of particles, 
the Hcisenberg limit. Surprisingly perhaps, we find that 
a large number of states can achieve that limit. They typ- 
ically consist of superpositions of a small number of Fock 
states, with few restrictions on the relative phase of their 
complex amplitudes on on their symmetry. An input 
state of particular relevance consists of a gaussian distri- 
bution of amplitudes around the state \N/2)a\N/2)b, due 
principally to its relative ease of realization with simple 
Hamiltonian dynamics. 



Acknowledgments 

We would like to thank Lajos Diosi for insightful con- 
versations regarding the relation between the likelihood 



9 



function and information theoretical concepts. This work Then 

is supported in part by the US Office of Naval Research, 

by the National Science Foundation, by the US Army 

Research Office, and by the National Aeronautics and 

Space Administration. d{SA0) 



(A2) 



APPENDIX A: FISHER INFORMATION 

Consider the deviation (SAg) of an observable from its 
mean value {A{9)) at a fixed phase 6, as a function of the 
phase shift (j), 



E(a,-(^W))P(,|0)i^^i^.(A3) 



(SAg) = ^ (flj — {A{9))) P{j\(j>). (Al) Regrouping the factors in Eq. (|A3|) and squaring gives 



d{SAg 



< 



logP(j|<^) 



r 



(A4) 
(A5) 



where we have used the Schwarz inequality in the last 
step. Noting that the second term in Eq. (|A2p vanishes 
due to normalization we have d{6Ag) /dcf) ~ d{A) / dcfi. We 
can then rewrite inequality (jASP and evaluate it at 9 to 
give the desired result: 



AA^ 



[d(A)/d<p\ 



> 



1 



Iog-P(3l0) I 
dtp I 



2 • 



(A6) 



This is the Cramer- Roa inequality, Eq. ([25|) . in the cur- 
rent context. The denominator on the right in Eq. (jA6P 
defines the Fisher information. 



APPENDIX B: SIMULATED ANNEALING 

Simulated annealing (25l . [26l | is a mathematical 
approach to global optimization simulating the metal- 
lurgical process whereby an amorphous compound is 
successively heated and cooled while gradually lowering 
the average temperature in an attempt to enlarge the 
grain size of single crystals in the compound. The 
protocol is as follows: 

1. Initial conditions for the optimization parameters, 
X = {xi, X2, ■■■Xr) are chosen, usually at random. 

2. With each allowed value of x is associated an pseudo- 
energy, E{x), which is the quantity to be minimized. 

3. A new value, x' = x + Ax, is then generated for 
the optimization parameters, and the change in energy 
AE = E{x') - E{x.) calculated. 



4. The new value of x always replaces the old if 
p = g-A_E/fcT ^ 1^ or with probability p if p < 1. 
Here fc is a constant of proportionality analogous to 
Boltzmann's constant in statistical mechanics, and T is 
a pseudo-temperature. 

5. The process is repeated. 

Accepting new parameter sets for which p < 1, i.e. up- 
hill steps on the energy manifold, allows the algorithm to 
explore the whole parameter space instead of converging 
directly to the closest local minimum. 

Two important considerations in this procedure are the 
method of choosing the next set of parameters x' and the 
annealing schedule, i.e. the protocol for gradually low- 
ering the temperature with intermittent heating cycles 
until the system has frozen into, hopefully, a global min- 
imum. Various approaches to these considerations have 
been discussed in the literature [1^, . 

In our implementation the optimization parameters are 
the set of amplitudes am of the input state vector Eq. 
(fTO)) and the energy the inverse square root of the Fisher 
information. We execute the simulated annealing algo- 
rithm not on a single vector a, but a population of vec- 
tors chosen at random. The pseudo-temperature of the 
system is set by the average energy of the population. 



T 



(Bl) 



where P is the number of state vectors in the popula- 
tion and Fi the Fisher information of the i^^ state vec- 
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tor. Defining the temperature in this way self-regulates 
the cooling cycle. If a single global minimum exists, the 
algorithm will continue sampling until the majority of 
state vectors have fallen into the global minimum. On the 
other hand if many local minima of comparable depth ex- 
ist the algorithm will also continue to sample the param- 
eter space until the majority of state vectors have found 
such local minima. As more local minima arc found the 
system "cools down" by itself. 

When the state vectors a have converged near the min- 
ima and the step size |Aa| = |a' — ol\ is fixed, all new 
steps a' will be uphill, thus halting further convergence. 
To enable further convergence we therefore half the step 
size when the number of downhill steps found over a sev- 
eral iterations drops below a threshold. 

It may also happen in our approach that the system 
reaches an equilibrium condition in which the average 
number of uphill steps accepted become equal to the av- 
erage number of downhill steps found. To ensure that 
the system continues to converge towards minima, we 
lower Boltzmann's constant by fc — > 0.9fc if this point is 
reached. We take as an indicator that the system is near 
this point whenever the number of accepted uphill steps 
is larger than the number of downhill steps in a given 
iteration cycle. 

To summarize the algorithm: 

1. Choose initial population at random and calculate 
pscudo-tcmpcraturc 

2. Find new state vectors a' = a + Aa and replaces 
the old if p = Q~^E/kT ^ or with probabihty p 
if p < 1 

3. If the number of uphill steps accepted is greater 
than number of downhill steps decrease k — > 0.9fc. 

4. If the number of downhill steps found is less than 
specified threshold reduce Aa 0.5Aa. 



5. Repeat algorithm 

We have implemented searches that assume either real 
or complex amplitudes. In addition, in some searches we 
imposed no restrictions on the symmetry of input states, 
while in others we forced the input states to be either 
symmetric or anti-symmetric around \N/2)a\N/2)i,. 

In the case of real amplitudes, the initial population 
was chosen by generating a random number between 
[— 1 , 1] for each and then normalizing the state vec- 
tor. For complex amplitudes the magnitudes \a„i\ were 
chosen at random between and 1 and a complex phase 
between and 27r. 

We have used two different approaches to specifying 
the new state vectors a' during each iteration. In the 
first one new state vectors were selected by changing 
each amplitude at random within an interval eam < 
a'^ < (1 -f e)Q!m, with e < 1 and then renormalizing 
the state vector. In the case of complex amplitudes a 
new phase was also chosen as (j)'^ = (pm + A(f> where 
— e < A(j> < e, while in the case of real amplitudes 
sign changes were allowed if \am\ < 0.02 by choosing 
— (l + e)|Q;„i| < a'm < (l-l-e)|am|- In this approach, step 
(4) in the algorithm described above was rather insensi- 
tive to the values chosen for e and e. They were therefore 
taken to have fixed values e = 0.5 and e = 0.05. 

In the second approach each vector a is specified by 
a set of angles such that the vector moves on a hyper- 
spherical surface of radius 1 to ensure normalization. The 
next vector is chosen in a random direction on the hyper- 
sphere with the initial step size A^ =0.1. It is decreased 
in successive iterations according to step (5) in the algo- 
rithm above. 

In the first approach the state vectors in the population 
settle in many local minima of comparable depths, while 
in the second approach all state vectors converge to an 
apparent global minimum which is the state shown in 
row 1 of column A in Fig. 3. 
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